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The functional roles of SNPs within the 8q24 gene desert in the cancer phenotype are not yet well understood. Here, we 
report that CCAT2, a novel long noncoding RNA transcript (IncRNA) encompassing the rs6983267 SNP, is highly over- 
expressed in microsatellite-stable colorectal cancer and promotes tumor growth, metastasis, and chromosomal instability. 
We demonstrate that MYC, miR-17-5p, and miR-20a are up-regulated by CCAT2 through TCF7L2-mediated transcriptional 
regulation. We further identify the physical interaction between CCAT2 and TCF7L2 resulting in an enhancement of WNT 
signaling activity. We show that CCAT2 is itself a WNT downstream target, which suggests the existence of a feedback loop. 
Finally, we demonstrate that the SNP status affects CCAT2 expression and the risk allele G produces more CCAT2 transcript. 
Our results support a new mechanism of MYC and WNT regulation by the novel IncRNA CCAT2 in colorectal cancer 
pathogenesis, and provide an alternative explanation of the SNP-conferred cancer risk. 



[Supplemental material is available for this article.] 

Notwithstanding the considerable advancements in our under- 
standing of the molecular genetic basis of cancer, in the majority of 
cancer-associated genomic regions, the responsible protein-coding 
genes have not been identified yet. The discovery of short (19- 
22 nt), noncoding RNAs (ncRNAs) — called microRNAs (miRNAs) 
(Ambros 2001) — not only revealed a novel mechanism of gene 
regulation but also led to the identification of miRNAs directly 
involved in cancer development (Spizzo et al. 2009). It is therefore 
plausible that as-yet-unidentified members of the broader category 
of ncRNA mapping to cancer-associated genomic regions play rate- 
limiting roles in tumor initiation and/or progression (Rinn and 
Chang 2012). For instance, we previously reported that highly 
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conserved genomic regions (ultraconserved regions, or UCRs) 
(Bejerano et al. 2004) are frequently transcribed as long (>200 bp) 
ncRNAs (IncRNAs) in both normal and tumor tissues (Calin et al. 
2007). Furthermore, germline mutations, as well as single nucle- 
otide polymorphisms (SNPs) in ultraconserved ncRNAs, were 
found to occur more frequently in patients with colon cancer and 
chronic leukemia than in the general population (Wojcik et al. 
2010). 

The rs6983267 SNP, mapping to the 8q24.21 chromosomal 
region, has been consistently associated with an increased risk of 
colorectal cancer (CRC) (Haiman et al. 2007): The G allele was as- 
sociated with greater predisposition to CRC than the T allele (odds 
ratios of 1.27 and 1.47 for heterozygotes and homozygotes, re- 
spectively; P = 1.27 X 10" 14 ) (Tomlinson et al. 2007). The increased 
cancer risk from this SNP variant was also observed in other cancer 
types, including prostate, ovarian, and inflammatory breast cancer 
(Ghoussaini et al. 2008; Bertucci et al. 2012). Despite the consistent 
association between rs6983267 and cancer risk, the underlying 
molecular and cellular mechanisms remain largely unknown. The 
genomic region spanning rs6983267 was found to contain DNA 
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enhancer elements (Pomerantz et al. 2009; Tuupanen et al. 2009), 
and the allelic variants were shown to confer different binding 
affinity to TCF7L2 (transcription factor 7-like 2 [T-cell specific, 
HMG-box]) ; a transcription factor that, together with CTNNB1, 
plays a central role in the transcriptional activation of WNT target 
genes. In view of the well-established role of constitutive WNT 
signaling activation in CRC, these findings suggest that rs6983267 
itself resides in a functional element that directly participates in 
colon cancer pathogenesis. 

On the base of the localization of rs6983267 within a highly 
conserved region of the genome (http://genome.ucsc.edu) (Sotelo 
et al. 2010) and our previous report of transcription in UCRs (Calin 
et al. 2007), we hypothesized that an as-yet-unidentified, highly 
conserved ncRNA could be transcribed from this genomic region. 



The association between rs6983267 and CRC risk further sug- 
gested that this ncRNA is likely to be aberrantly expressed in 
colon cancer and to play a role in CRC carcinogenesis through 
WNT signaling. 

Results 

CCAT2, a novel IncRNA transcript maps to the highly conserved 
8q24.21 region encompassing rs6983267 and is overexpressed 
in microsatellite-stable CRC samples 

The 8q24.21 genomic region encompasses a highly conserved 
segment with homologies found in 28 species-conserved tracks 
(University of California, Santa Cruz Genome Browser; http:// 
genome.ucsc.edu/) (Fig. 1A). We designed primers spanning the 
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Figure 1. CCAT2, a novel IncRNA spanning rs6983267, is overexpressed in colorectal cancer samples. (A) Genomic features of the 8q24.21 region 
spanning SNP rs6983267 and the genomic location of CCAT2. Numbers in parentheses show the genomic distance relative to rs6983267 (image not to 
scale). (B) CCAT2 expression in CRC patient samples. CCAT2 expression was quantified using qRT-PCR, and data are presented as box-whisker plots 
showing the five statistics (lower whisker is 5% minimum, lower box part is the 25th percentile, solid line in box represents the median, upper box part is 
75th percentile, and upper whisker is 95% maximum). (C) In situ hybridization of CCAT2 and RNU6-6P (U6) (as normalizer) in CRC patient samples. 



Genome Research 1447 



www.genome.org 



Ling et al. 



rs6983267 SNP and found that this region was transcribed at high 
levels in human pancreas, lung, prostate, and testis tissues (data 
not shown). Subsequently, we cloned by RACE (rapid amplifica- 
tion of cDNA ends) a 340-bp nonspliced transcript from bone 
marrow cDNA (Fig. 1A). To exclude the possibility that this tran- 
script was part of long intergenic noncoding RNAs (lincRNAs) 
mapped in this region (RP11-382A18.2 and RP11-382A18.1), we 
performed polymerase chain reaction (PCR) amplification using 
lincRNA-specific primers combined with primers from within 
the 340-bp transcript. We did not observe any amplified product in 
the COLO320 colon cancer cell line that expresses the highest 
levels of the newly identified transcript, suggesting that it is dis- 
tinct from the two lincRNAs. We further cloned an intronless 
630-bp transcript from mouse testis cDNA. Notably, the sequence 
overlapping between the transcripts cloned from the human 
and mouse tissues shared 93% homology and encompassed the 
rs6983267 SNP. 

Because a previous study using a tiling assay did not find any 
transcriptional activity in this region (Pomerantz et al. 2009), we 
sought other independent data to support the existence of this 
newly identified RNA transcript. By searching the Long RNA-seq 
data from ENCODE/CSHL (http://genome.ucsc.edu/; Djebali et al. 
2012), we found that the rs6983267 region is actively transcribed 
in the genomic sense orientation in GM12878 lymphoblastoid 
cells (Supplemental Fig. SI). We next determined whether this 
transcript has protein-coding potential. The transcript's non- 
coding nature was suggested by the negative results of the in vitro 
translation assay, along with the lack of consistent open reading 
frames (ORFs) by ORF Finder (http://www.ncbi.nlm.nih.gov/gorf/ 
gorf.html), negative score by PhyloCSF (-271.5366, meaning that 
CCAT2 is io 27 - 15366 times more likely to be a noncoding sequence 
than a coding one), a comparative genomic method differentiating 
coding and noncoding RNA (Lin et al. 2011), lack of coding po- 
tential determined by coding potential assessment tool (CPAT; 
http://lilab.research.bcm.edu/cpat/) (Wang et al. 2013), and neg- 
ative data from the small-peptide database (Pepbank; http:// 
pepbank.mgh.harvard.edu/search/blast) (Supplemental Fig. S2A). 
We further confirmed the existence of this novel RNA by using 
Northern blot analysis on a panel of colon cancer cell lines probed 
by sense-specific oligonucleotides spanning rs6983267: A tran- 
script of —400 bp in genomic sense orientation (centromere to 
telomere) was detected with the highest expression in COLO320 
cells (Supplemental Fig. S2B). Consistent with this observation, 
Southern blot analysis revealed that among analyzed samples, 
COLO320 cells had the highest degree of genomic amplification of 
the 8q24 region (data not shown). By taking advantage of a cohort 
of nine paired colon specimens, we determined by quantitative 
real-time PCR (qRT-PCR) analysis that the newly identified tran- 
script was expressed at much higher levels in tumor tissue than in 
the adjacent normal mucosa, and confirmed that the transcription 
occurs in the genomic sense orientation (Supplemental Fig. S2C). 
We named this novel gene transcript colon cancer-associated 
transcript 2 (CCAT2). 

We further measured CCAT2 expression by qRT-PCR in 215 
CRC and 94 paired non-neoplastic mucosal specimens obtained 
from patients from four different geographical regions (i.e., Italy, 
Japan, Australia, and North America). We identified significantly 
higher CCAT2 expression in the CRC tissue as compared with the 
adjacent mucosae (Fig. IB). Moreover, microsatellite-stable (MSS) 
cancers had higher CCAT2 expression (approximately 10 times) 
than microsatellite-unstable (MSI-H) tumors and adjacent normal 
colon mucosae (Fig. IB; Supplemental Fig. S2C). To visualize 



CCAT2 expression, we performed in situ hybridization by using 
a locked nucleic acid probe designed against CCAT2 RNA. In 
agreement with the qRT-PCR findings (Fig. IB), the epithelial ex- 
pression pattern observed had stronger staining in MSS than in 
MSI-H cancers and normal mucosae (Fig. 1C). We did not observe 
a significant association between CCAT2 expression and genomic 
DNA amplification in an additional set of CRC samples from Japan 
(n = 64, P = 0.46) (Supplemental Fig. S2D), suggesting that CCAT2 
expression is, at least partially, independent of the genomic DNA 
copy number alterations in this chromosomal region. 

CCAT2 promotes cancer growth and metastasis 

To test the hypothesis that CCAT2 plays an oncogenic role in CRC, 
we cloned CCAT2 into a retroviral expression vector and trans- 
duced it into HCT116 colon cancer cells. This is an MSI-H cell line 
(MSH6 mutant) with a near-diploid karyotype (Lengauer et al. 
1997) that expresses low levels of CCAT2 (when compared with 
COLO320) (Supplemental Fig. S2B). Retrovirus transfection of the 
CCAT2 construct caused a 630-times increase in CCAT2 expression 
compared with the cells transfected with empty vector (Supple- 
mental Fig. S3 A). For comparison, the expression difference between 
high-expresser COLO320 cells and low-expresser HCT116 cells was 
about 1500 times (Supplemental Fig. S3 A). The CG4T2-transduced 
HCT116 cells did not affect cell proliferation in two-dimensional 
(2D) culture conditions (Supplemental Fig. S3B), but showed a 
growth advantage under low-adherence conditions and increased 
colony-forming potential (Supplemental Fig. S3C,D). Subcutaneous 
transplantation of CG4T2-overexpressing HCT116 cells resulted in 
larger xenograft tumors in Swiss nu-nu/Ncr nude mice when com- 
pared with empty vector-transduced cells (Fig. 2A). This observed in 
vivo tumor promotion effect of CCAT2 was independently con- 
firmed by a xenograft study using pcDNA construct-established 
CG4T2-overexpressing HCT116 cells (data not shown). 

We next studied whether CCAT2 is involved in promoting the 
metastatic phenotype in colon cancer cells. The in vitro migration 
assay showed a twofold increase in the migration of CCAT2- 
transduced HCT116 cells (P = 0.001) (Fig. 2B). To validate the role 
of CCAT2 in the metastatic phenotype, we used KM12SM cells, 
a CRC cell line established from the spontaneous liver metastasis of 
KM12C cells (Camps et al. 2004). Knockdown of CCAT2 by siRNA, 
which reduced the CCAT2 expression by 70%, significantly re- 
duced the invasive ability of KM12SM cells (that express high 
levels of CCAT2), suggesting the involvement of CCAT2 in me- 
tastasis (Fig. 2C; Supplemental Fig. S4). More importantly, spleen 
injection of CG4T2-overexpressing HCT116 cells into Swiss nu-nu/ 
Ncr nude mice resulted in a higher incidence of liver metastasis (six 
of nine mice vs. one of five mice) as well as a greater number of 
metastatic tumors (P = 0.025) than in the control group (Fig. 2D). 
These studies suggest that modulation of CCAT2 expression levels 
changes the metastatic capacity of cancer cells. In addition, in the 
Italian cohort of CRC samples (which was the larger available set), 
we found significantly higher CCAT2 expression levels in primary 
CRC tumors from patients with metastasis (TNM category Ml) 
than in those without metastasis (M0; P = 0.003) (Fig. 2E). 

To further expand on this issue and in view of the previous 
report on the correlation between rs6983267 and breast cancer 
aggressiveness (Bertucci et al. 2012), we measured CCAT2 RNA 
expression levels in 129 lymph node-positive breast cancer pa- 
tients who received an adjuvant cyclophosphamide, methotrexate, 
and 5-fluorouracil combination regimen. High CCAT2 RNA levels 
were associated with shorter metastasis-free survival when analyzed 
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Figure 2. CCAT2 promotes tumor growth and metastasis. (A) CCAT2 increased subcutaneous tumor formation in a mouse xenograft model. Com- 
parison was made between the empty vector and CCAT2 groups at the indicated weekly time points using the t-test. (B) HCT1 1 6 cells transduced with 
CCAT2 showed significantly higher migration ability, as measured by using a migration chamber. Data are presented as mean ± SEM from three in- 
dependent experiments. (C) Knockdown of CCAT2 reduces invasion ability of KM1 2SM colon cancer cells. After treatment with CCAT2 siRNA (50 nM) for 
24 h, cells were seeded onto an invasion assay chamber for 48 h, and invaded cells were stained and counted. (D) CCA 12 enhances liver metastasis in mice 
that were incubated by intrasplenic injection with HCT1 16 cells. A representative image of liver metastasis from the CCAT2 group is shown, (f) CCAT2 
expression levels in Italian primary CRC samples from patients with metastasis (Ml) were higher than those from without metastasis (M0). Comparison 
was made using the Mann-Whitney test. Data are presented as box-whisker plots. (F) Breast cancer patients with high CCAT2 had shorter metastasis-free 
survival. Kaplan-Meier analysis as a function of CCAT2 levels in 129 lymph node-positive breast cancer patients who received adjuvant combination 
chemotherapy with cyclophosphamide, methotrexate, and 5-fluorouracil. CCAT2 high level to normalizer, >0.0045; CCAT2 low level, <0.0045. 



as both a continuous variable (Supplemental Table SI) and as a di- 
chotomized variable using the median level as a cutoff point (Fig. 2F). 

CCAT2 induces chromosomal instability 

Because chromosomal instability (CIN) represents a main feature 
of MSS CRCs, and in view of the high CCAT2 expression levels 



among MSS colon cancers (Fig. 1B,C), we hypothesized that 
CCAT2 may underlie CIN development. We generated stable 
HCT116 clones overexpressing CCAT2 (OC1 and OC2) or with 
basal CCAT2 expression (El and E2) by using the pcDNA expres- 
sion vector (Fig. 3 A). Genomic instability analysis showed that the 
percentage of cells with normal metaphases was markedly lower 
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Figure 3. CCAT2 induces chromosomal instability in HCT116 cells. (A) Increased chromosomal abnormalities in CG472-overexpressing clones, as 
identified by genomic instability analysis. (B) Example of spectral karyotypes of HCT1 16 clones revealed a change from the original diploid (El) to the 
tetraploid karyotype in the OC2 clone. (QD) CCAT2 induces chromosomal instability in an independent set of CG472-overexpressing clones. The number 
of chromosomal abnormalities in each clone was counted; data are expressed as mean ± SD. (f ) Abnormal centrosomes in CCA 72-overexpressing clones 
OC1 and OC2 by immunostaining. Beta-tubulin (red); CREST/kinetochore (green). 



in OC (59.5% in OC1 and 31.4% in OC2) than in E (84.2% in El 
and 94.3% in E2) clones (Fig. 3A). In the same analysis, CCAT2 
dramatically increased the percentage of polyploid cells from 5.3% 



(El) and 2.8% (E2) to 33.3% (OC1) and 60.0% (OC2) (Fig. 3A). We 
detected near-tetraploid status in clone OC2 by spectral karyotype 
analysis (Fig. 3B) and observed a doubling of nuclear size and DNA 
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content by DAPI staining and flow cytometric analysis (Supple- 
mental Fig. S5A,B) . The CIN-inducing ability by CCAT2 was further 
validated by a second batch of eight overexpressing clones that 
exhibited numerical and structural chromosomal changes (Fig. 
3C,D; Supplemental Fig. S5C). CIN was also observed in HCT116 
cells transduced with CC4T2-containing retrovirus (Supplemental 
Fig. S5D). Stable knockdown of CCAT2 expression in COLO320 
cells partially reversed the abnormal metaphase observed in 
COLO320 cells (Supplemental Fig. S5E). To further identify 
the mechanism by which CCAT2 overexpression may underlie 
CIN, we evaluated spindle and CREST/kinetochore structure by 
immunocytochemistry analysis of the CCAT2 clones (El, E2, OC1, 
and OC2) with antibodies directed against beta-tubulin and the 
centromere. Normal numbers of centrosomes were observed in 
both El and E2 with spindles separating chromosomes from two 
centromeres in the nucleus (Fig. 3E). However, we detected aberrant 
numbers of centrosomes pulling chromosomes from three or more 
different points in the CG4T2-overexpressing clones (Fig. 3E). The 
observed defects in centrosomes are likely to result in abnormal 
chromosomal breakage and fusion, and eventually aneuploidy. 

Because CIN scores were already available from 218 breast 
cancer patients with lymph node-negative disease (Smid et al. 
2011), we assessed the CCAT2 expression levels in these tumor 
samples. We identified a significant positive correlation between 
CCAT2 level and CIN score (Spearman correlation: Rs = 0.17, 
P = 0.012). 

CCAT2 regulates MYC transcription 

To explore the molecular mechanism underlying CCAT2's onco- 
genic potential, we characterized the expression of other tran- 
scripts located in the same region: MYC and exons 6 and 8 of 
DQ5 15897 (but not POU5F1B) showed expression patterns similar 
to that of CCAT2, although with smaller differences among MSI-H, 
and MSS cancers and normal colorectal mucosa (Fig. 4A; Supple- 
mental Fig. S6A). In view of these associative data and of previous 
reports describing a long-range interaction between rs6983267 
and MYC (Pomerantz et al. 2009; Sotelo et al. 2010), we asked 
whether CCAT2 plays a role in regulating MYC expression. Among 
the HCT116 clones, we found higher MYC expression at both the 
RNA and protein levels in cells overexpressing CCAT2 (3.4-fold and 
2.5-fold increases in OC1 and OC2, respectively; P = 0.030 and P = 
0.038, respectively) (Fig. 4B). Higher MYC expression was also 
found in three additional independent HCT116 clones over- 
expressing CCAT2 (Supplemental Fig. S6B). Expression of down- 
stream MYC protein-coding gene targets such as BAX (Mitchell 
et al. 2000), CDC25A (Galaktionov et al. 1996), and CDKN2A 
(Zindy et al. 1998), and the microRNA targets such as MIR17HG 
(O'Donnell et al. 2005) and miR-146a (Chang et al. 2008) were also 
changed accordingly (Fig. 4C; Supplemental Fig. S6C,D). Addi- 
tionally, the MIR17HG was also up-regulated in retrovirus-trans- 
fected cells (Supplemental Fig. S6E) and was significantly reduced 
by transient MYC down-regulation (Supplemental Fig. S6F), sug- 
gesting that these microRNAs are regulated by MYC in our ex- 
perimental systems. Conversely, transient knockdown of CCAT2 
reduced MYC expression (both RNA and protein) in CCAT2- 
overexpressing clones (Fig. 4D), suggesting that CCAT2 RNA, and 
not merely its genomic locus, is required for the maintenance of 
high MYC expression levels. In addition, sustained down-regulation 
of CCAT2 by shRNA retroviruses decreased MYC expression and 
MIR17HG expression in COLO320 cells in a CC4T2-dependent 
fashion (Fig. 4E; Supplemental Fig. S6G). This effect was further 



validated in Tet-on inducible COLO320 clones, where CCAT2 
expression was reduced by specific shRNAs in response to doxy- 
cycline (Fig. 4F). 

Next, we moved on to determine whether MYC and/or its 
targets are mediators of the above-observed CC4T2-driven en- 
hanced cell migration. In our experimental system, reducing MYC 
expression did not significantly decrease the migration of CCAT2- 
transduced cells (Supplemental Fig. S6H). However, consistent 
with the consensus that the MIR17HG increases metastasis (Bartley 
et al. 201 1; Zhang et al. 2012), knockdown of miR-1 7-5p or miR-20a 
in CCAT2-overexpressing cells led to a significant decrease in cell 
migration (Fig. 4G; Supplemental Fig. S6I). These findings suggest 
a not-yet-identified complex network underlying CG4T2-elicited 
metastatic phenotype. 

Mechanisms of CCATTs regulatory effect 

Because CCAT2's subcellular localization may provide additional 
clues about its functional role, we measured CCAT2 expression in 
nuclear and cytosolic fractions from CG4T2-expressing HCT116 
subclones and the COLO320 cell line by qRT-PCR. The differ- 
ential enrichments of GAPDH and RNU6-6P RNA were used as 
fractionation indicators (Supplemental Fig. S7). In these samples, 
we found a considerable increase in CCAT2 expression in the 
nucleus versus the cytosol, thus indicating that CCAT2 is mainly 
localized in the nucleus (Fig. 5 A). This was further confirmed by 
the in situ hybridization findings in CRC samples (Fig. 5B). 
Consistent with our findings, RNA-seq data showed that the 
transcript encompassing rs6983267 is exclusively expressed in 
the nucleus of GM12878 lymphoblastoid cells (Supplemental 
Fig. SI). We also investigated a potential effect of CCAT2 on MYC 
mRNA stability (Supplemental Fig. S8), as well as an RNA en- 
hancer effect (Supplemental Fig. S9; Supplemental Results; 
Orom et al. 2010), but could not find any significant variations 
related to CCAT2 levels. 

Aberrant WNT activation occurs in the majority of CRCs 
(Clevers 2006), and the risk allele of SNP rs6983267 has been 
shown to confer enhanced response to WNT signaling through 
TCF7L2 binding (Tuupanen et al. 2009). Since MYC is an estab- 
lished target of TCF7L2 as confirmed by the presence of TCF7L2- 
binding sites in its promoter as well as by functional studies (He 
et al. 1998), it is possible that CCAT2 affects MYC expression 
through the mediation of TCF7L2. Consistent with previous 
findings, we identified TCF7L2 binding to the MYC promoter re- 
gion and observed enhanced binding in the OC1 clone when 
compared with the El clone (Fig. 5C). Next, we sought to de- 
termine whether this was due to an increase in nuclear CTNNB1 or 
TCF7L2 accumulation. Using Western blot analysis, we observed 
a comparable level of total CTNNB1 expression levels with no in- 
crease in nuclear CTNNB1 between CG4T2-overexpressing clones 
and their control cells (Supplemental Fig. S10). Similarly, TCF7L2 
expression levels were comparable in all four clones (Supplemental 
Fig. S10). Notably, by using a TOP-Flash lucif erase assay, we ob- 
served a twofold increase in WNT reporter activity in OC1 and OC2 
clones with increased CCAT2 expression levels (Fig. 5D). Increased 
luciferase activity was also observed in an independently generated 
set of high-CG4r2-expressing HCT116 clones (Supplemental Fig. 
Sll). This result was further confirmed in HEK293 WNT reporter 
cells transiently transfected with CCAT2 expression vector (Fig. 
5E). Consistent with these observations, WNT reporter activity in 
COLO320 cells was effectively reduced by CCAT2 siRNA and also as 
expected by TCF7L2 siRNA (Supplemental Fig. S12). These find- 
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Figure 4. Regulation of MKC expression by CCAT2. (A) Increased CCAT2 and MKC expression in MSS CRC samples. Expression profile of MYC and CG4T2 
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expression in COLO320shCG472 stable clones. (F) Reduction of MVCin doxycycline-inducible shCCA 12 clones of COLO320. (G) Reduction of migration in 
CC/472-transduced HCT1 16 cells by anti-miR-1 7-5p and anti-miR-20a. Twenty-four hours after anti-miR treatment, cells were seeded onto a migration 
assay chamber for another 24 h, and migrated cells were stained and counted. (*) P < 0.05. 



ings suggest that CCAT2 augments TCF7L2 transcriptional activity 
without increasing the quantity of TCF7L2. 

We used RNA immunoprecipitation analysis to examine the 
physical interaction between CCAT2 and TCF7L2 by pulling 
down RNAs colocalized with TCF7L2 protein. We found an en- 
richment of CCAT2 RNA, but not GAPDH and the HOTAIR 
ncRNA by TCF7L2 antibody compared with the IgG control in 
the OC1 clone (Fig. 5F; Supplemental Fig. S13). To test whether 
endogenous CCAT2 binds to TCF7L2 protein, we analyzed the 
RNA-protein association in COLO320 cells that express high 
CCAT2 levels. Similar to the findings in the OC1 HCT116 clone, 
a significant enrichment of CCAT2 was observed in TCF7L2 pull- 



down RNA samples from COLO320 cells (Fig. 5G; Supplemental 
Fig. S13). 

Overall, these results suggest that the main mechanism 
through which CCAT2 increases MYC expression relies on activa- 
tion of WNT signaling by enhancing TCF7L2 transcriptional ac- 
tivity. Accordingly, the expression of specific WNT target genes 
such as CD44 (Wielenga et al. 1999) and VIM (Gilles et al. 2003) 
was increased in CG4T2-overexpressing clones (Fig. 5H,I). Con- 
versely, knockdown of CCAT2 expression caused a reduction in the 
expression level of VIM in both KM12SM (with high endogenous 
CCAT2 expression) and COLO320 (with very high endogenous 
CCAT2 expression) cells (Supplemental Fig. SI 4). 
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CCAT2 responds to WNT signaling 

Previously, a TCF7L2-binding element was found to be located 
within the rs6983267 genomic span (Pomerantz et al. 2009; 
Tuupanen et al. 2009), which suggested that CCAT2 expression 
levels might be regulated by this transcription factor. Therefore, 



we treated three colon cancer cell lines (COLO320, HCT116, and 
RKO) with lithium chloride (LiCl) in order to further activate ca- 
nonical WNT signaling through inhibition of glycogen synthesis 
kinase 3 beta (GSK3B) (Tuupanen et al. 2009). The effect of LiCl was 
monitored by TOP-Flash luciferase activity and by AXIN2 expres- 
sion (Jho et al. 2002). In all these cell lines, we observed an increase 
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in CCAT2 expression after LiCl treatment (Fig. 6A). TCF7L2 
knockdown using two different siRNAs impaired the LiCl-induced 
CCAT2 expression increase in HCT116 cells (Fig. 6B; Supplemental 
Fig. S15), suggesting that the effect on WNT signaling was TCF7L2 
dependent. In the CG4T2-overexpressing clones, TCF7L2 siRNA 
consistently reduced CCAT2 expression, either with or without LiCl 



stimulation, indicating that TCF7L2 is necessary for the mainte- 
nance of high CCAT2 expression levels (Fig. 6C). The endogenous 
expression of CCAT2 was also significantly down-regulated by 
TCF7L2 siRNA in KM12SM and COLO320 cells (Fig. 6D,E). These 
findings suggest the existence of a complex feedback loop of 
CCAT2 and WNT signaling. 
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rs6983267 alleles affect CCAT2 expression and function 

The discovery of CCAT2 and its role in CRC suggests that it may 
underlie the mechanism of the rs6983267 SNP-conferred cancer 
risk. We first test if rs6983267 status affects CCAT2 expression in 
patient samples. We genotyped the rs6983267 SNP in previously 
profiled MSS samples (Italian, Japan, and Australian cohorts) (Fig. 
IB) and separated them into GG, GT, and TT groups. We did not 
observe a significant difference (P = 0.450) in CCAT2 expression 
levels between the GG (n = 52) and TT (n = 45) groups (Supple- 
mental Fig. SI 6). However, further separation of the groups ac- 
cording to the individual cohort revealed that the difference (P = 
0.009) is significant in the Italian cohort, which has the largest 
sample size (Fig. 6F). Inconsistently, there was no significant dif- 
ference in CCAT2 expression between the GG and TT alleles in 93 
CRC patients from the United Kingdom (data not shown). These 
data suggest that the effect of the rs6983267 allele on CCAT2 
expression levels may be very subtle, and for detection better- 
controlled experimental designs are needed. 

To address this question, we took advantage of CRC cell lines 
(DLD1 and KM12SM cells) with a heterogeneous rs6983267 ge- 
notype, to measure the amount of CCAT2 transcript produced 
from different alleles. We amplified the region spanning the SNP 
using genomic DNA or RNA as template and sequenced the PCR 
products by Pyrosequencing, a method that allows for accurate 
and quantitative analysis of DNA sequence (Fakhrai-Rad et al. 
2002). In both DLD1 and KM12SM cells, the percentage of G allele 
is slightly (—6%), but significantly, higher in RNA samples than 
the corresponding genomic DNA samples (Fig. 6G). These data 
prompted us to determine the allele ratio in genomic DNA and 
RNA of CRC samples with GT genotype. As shown in Figure 6H, 
three out of four samples showed consistent findings — that is, 
significantly higher percentage (7%-15%) of G allele in RNA than 
in DNA samples. These data suggest that the G allele of rs6983267 
produces more CCAT2 transcripts than the T allele. 

We next determined whether CCAT2 transcripts containing 
G or T have different biological functions. We mutated the SNP site 
of the retrovirus CCAT2 construct from G to T and transfected 
HCT116 cells with CCAT2-G or CCAT2-T retroviruses. Although 
we observed higher MYC expression (both RNA and protein) in 
CCAT2-G- than CC4T2-T-transfected cells (Supplemental Fig. 
S17A,B), the difference becomes insignificant after normalizing to 
CCAT2 expression levels (Supplemental Fig. SI 7C). To test whether 
the rs6983267 allele affects CCAT2's regulation of MYC in CRC 
patients, we analyzed CCAT2 and MYC expression correlation in 
previously genotyped MSS CRC samples. A significant positive 
correlation between CCAT2 and MYC expression was observed in 
GG samples (n = 52, coefficient = 0.43; P = 0.001), but not in TT 
samples (n = 45, coefficient = 0.26; P = 0.078) of CRCs (Supple- 
mental Fig. S18). Furthermore, by using the Vienna RNA Secondary 
Structure Package (http://www.tbi.univie.ac.at/~ivo/RNA/), we 
identified that the CCAT2 G and T alleles have distinct con- 
formations (Supplemental Fig. SI 9). Although further studies are 
needed to verify the different biological functions of the rs6983267 
G and T alleles, our data indicate that the specific alleles may also 
differentially affect MYC expression. 

Discussion 

We report the identification of CCAT2, a novel IncRNA encom- 
passing the cancer-related rs6983267 SNP and located in a highly 
conserved genomic region enriched for markers of regulatory 



elements, such as H3K4mel, p300, and H3K27ac (Pomerantz et al. 
2009). This IncRNA is expressed at high levels in MSS CRC tumor 
tissues and influences MYC levels, but has very low-level expres- 
sion in normal colon tissues. This could explain how this new gene 
has escaped identification, until now, after genome-wide associa- 
tion studies identified the association between the SNP rs6983267 
and cancer risk, as well as why tiling arrays did not identify it. A 
recent study showed that removal of the region encompassing the 
rs6983267 SNP confers resistance to intestinal tumors induced by 
the APCmin mutation in mice (Sur et al. 2012). Because we also 
cloned a novel transcript encompassing this SNP region from 
mouse tissue, it can be envisioned that deletion of the region also 
leads to removal of the CCAT2 transcript. Thus, this study lends 
further support to the oncogenic function of CCAT2 in CRC 
pathogenesis. In addition, the findings of decreased MYC expres- 
sion and reduced TCF7L2 binding upstream of MYC in knockout 
mice from the same study are also consistent with our findings that 
CCAT2 regulates MYC transcriptional activity. 

CG4T2-enhanced invasion and metastasis appear to occur 
through the MYC-regulated miRNAs miR-17-5p and miR-20a. The 
role of MYC in metastasis has been controversial, and a recent pub- 
lication shows that, in breast cancer, MYC suppresses metastasis 
by direct transcriptional silencing of integrins (Liu et al. 2012). Of 
interest, miR-20a was identified as being overexpressed in the 
transition from colon mucosa to early adenoma, and was predicted 
to target multiple WNT pathway genes (Bartley et al. 2011). The 
observation according to which CG4T2-enhanced invasion is 
mediated by MYC-regulated miRNAs and also possibly by other 
pro-metastatic targets such as CDC25A (Feng et al. 2011), rather 
than by MYC itself, suggests a complex regulatory network that 
needs to be further explored in the future. This is also in agreement 
with the recent observation by The Cancer Genome Atlas (TCGA) 
Consortium that MFC-driven transcriptional activation and re- 
pression play important roles in colon cancer (The Cancer Genome 
Atlas Network 2012). It is also possible that the effect of CCAT2 
on metastasis is mediated through its general activation of WNT 
signaling (Nguyen et al. 2009; Yu et al. 2012) or another unknown 
mechanism. 

The TOP-Flash luciferase assay suggests increased WNT ac- 
tivity in CG4T2-overexpressing cells. Therefore, we postulated that 
the increased MYC transcription could result from increased WNT 
activity, because MYC itself represents a well-established WNT 
target (He et al. 1998). Recent studies have identified RNA-protein 
interaction as one of the main regulatory mechanisms of IncRNAs 
(Rinn and Chang 2012). Coimmunoprecipitation of CCAT2 (RNA) 
and TCF7L2 (protein) suggests that CCAT2 may broadly regulate 
WNT target genes by binding to TCF7L2 and modulating its 
transcriptional activity. Various mechanisms can be envisaged to 
explain how the CG4T2/TCF7L2 association can enhance WNT 
activity. We speculate that CCAT2 may affect TCF7L2-driven gene 
transcription through modulating association of TCF7L2 with its 
partners, which include the negative regulators Groucho/TLE and 
CTBP1 and the activator CTNNB1 (Clevers 2006), or changing 
TCF7L2 protein function by bridging TCF7L2 with other proteins 
in the transcription complexes. Alternatively, the association of 
CCAT2 with TCF7L2 may modify the protein structure and en- 
hance its function. Regarding the mechanism of MYC regulation, 
it is also possible that CCAT2 participates in the loop formation 
between the genomic loci of rs6983267 with MYC promoter 
(Pomerantz et al. 2009) and works together with the enhancer el- 
ement to activate MYC transcription. Notably, we also show that 
WNT signaling regulates CCAT2 expression, suggesting a feedback 
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loop between CCAT2 and WNT signaling as previously shown for 
other targets such as CDH1 (Heuberger and Birchmeier 2010) and 
AXIN2 (Lustig et al. 2002). 

We demonstrated CCAT2's ability to underlie chromosomal 
instability manifested by the loss or gain of large portions or whole 
chromosomes, eventually resulting in aneuploidy a characteristic 
trait of MSS CRCs. These data support a scenario in which, together 
with mutations in coding genes such as APC (Fodde et al. 2001; 
Kaplan et al. 2001) and BUB1 (Cahill et al. 1998), CCAT2 contrib- 
utes to CIN in colon cancer. This is likely to occur through CCAT2's 
effects on downstream mediators such as MYC and/ or other WNT 
target genes. The direct involvement of MYC in CIN has been 
previously demonstrated by showing that 
transient excess of MYC activity could 
elicit genomic instability and carcino- 
genesis (Felsher and Bishop 1999). In- 
versely, Myc deletion rescues APC defi- 
ciency in the mouse intestine (Sansom 
et al. 2007). Thus, it is plausible that MYC 
acts as a mediator of CC4T2-induced CIN 
phenotype. Furthermore, the MIR17HG 
miRNAs, known to be MYC regulated, 
can also affect CIN as evidenced by dif- 
fuse large B-cell lymphoma that has high 
levels of these miRNAs and nonrandom 
chromosomal abnormalities (Li et al. 
2009). Alternatively, the CIN-inducing ef- 
fect can result from the general activation 
of WNT signaling by CCAT2. This hy- 
pothesis is supported by previous findings 
that CTNNBl/TCF7L2-mediated transcrip- 
tion (Aoki et al. 2007) or ectopic expres- 
sion of the WNT target gene conductin 
(AXIN2) (Hadjihannas et al. 2006) directly 
induces CIN. However, it should be noted 
that although CIN can be the underlying 
mechanism for CG4T2-promoted tumor 
growth and metastasis, mechanistic con- 
nections of these phenotypes are not 
established in this study. 

Finally, our Pyrosequencing data on 
heterogeneous cell line models and CRC 
samples suggest that rs6983267 status af- 
fects the CCAT2 expression, providing an 
additional mechanism linking the risk al- 
lele of rs6983267 with higher CRC risk. 
Although we demonstrated that WNT 
signaling regulates CCAT2 expression, it 
remains to be determined if this accounts 
for the SNP-caused differential expression 
of CCAT2. In addition, in CRC patients 
with GG (but not TT) genotype, CCAT2 
shows significant association with MYC 
expression levels, suggesting that the SNP 
status may also affect CCAT2 function as 
an RNA transcript. Although most of the 
previous studies failed to identify a corre- 
lation between this SNP and MYC expres- 
sion, recent research found higher MYC 
expression in CRC samples containing GG 
alleles, probably due to the improved pu- 
rity of the cancer tissues owing to the use 



of laser microdissection (Takatsuno et al. 2012). Our findings on 
SNP-induced differential CCAT2 expression and function, together 
with the DNA enhancer element (Pomerantz et al. 2009; Tuupanen 
et al. 2009), provide an explanation on the cancer risk conferred by 
the rs6983267 risk allele (Fig. 7). 

The discovery of CCAT2 introduces a novel mechanism for 
the involvement of ncRNAs in WNT signaling and colon cancer 
pathogenesis. The recent findings showing that the CIN phe- 
notype is a predictive marker for survival and may be used to 
select high-risk patients with stages II and III CRC (Watanabe 
et al. 2012), combined with the specific involvement of CCAT2 
in MSS CRCs indicate that CCAT2 has the potential to become 
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Figure 7. A model of CCAT2 locus involvement in CRC. (Upper panel) An ~335-kb DNA loop brings 
the rs6983267 genomic region close to the MYC locus, and this physical association may contribute to 
the enhancer function of the SNP-containing region on MYC transcription (Pomerantz et al. 2009). 
(Lower panel) The enhancer region is transcribed into a long noncoding RNA (CCAT2), and the SNP 
status affects CCAT2 expression by an as-yet-unknown mechanism. The CCAT2 transcript up-regulates 
WNT activity and increases expression levels of WNT target genes (including MYC). This regulation by 
CCAT2, possibly through its physical interaction with TCF7L2, may lead to genomic instability and 
promote cell growth. We demonstrated that MYC-regulated miR-1 7-5p and miR-20a participate in the 
CCAT2 -enhanced cell invasion and speculate that other mechanisms, such as MYC-related mechanisms 
(CDC25A) or enhanced WNT signaling (VIM and CD44), may exist to coordinate the metastatic phe- 
notype elicited by CCAT2. Finally, we demonstrated that CCAT2 expression is regulated by transcrip- 
tional factors TCF7L2, indicating a positive feedback loop between CCAT2 and WNT signaling. Our 
findings provide an additional explanation on the SNP-conferred CRC risk. (Black) Demonstrated; (gray) 
hypothesized interactions. 
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a useful diagnostic and/or prognostic marker, although further 
validation in larger cohorts and by multiple independent studies 
is necessary. 

Methods 

Patient samples 

One hundred and ninety-one colorectal cancer samples and 70 
non-neoplastic mucosa samples were used in this study. The tumor 
and control samples were obtained from three different sources: an 
Italian cohort from the University of Ferrara (CRC = 81 and normal 
colon mucosa = 25), a Japanese cohort from the University of 
Kyushu (CRC = 45 and paired normal colon mucosa = 45), an 
Australian cohort from Princess Alexandra Hospital (CRC = 65), 
and a C-CFR cohort from Northern American colon cancer patients 
with MSS, CIMP-negative colon cancer at three participating 
centers of the Colorectal Cancer Family Registry (Mayo Clinic, 
Rochester, Minnesota, USA; Mount Sinai Hospital, Ontario, Can- 
ada; and Cleveland Clinic, Cleveland, Ohio, USA) (CRC = 24 and 
paired normal colon mucosa = 24). Tissue samples were obtained 
from fresh surgical specimens frozen in liquid nitrogen and stored 
at -80°C. All the samples were obtained with the patients' in- 
formed consent and were histologically confirmed. 

Breast tumors were selected from the tumor bank at the 
Erasmus Medical Center (Rotterdam, The Netherlands). The frozen 
tumor samples were collected from female patients with breast 
cancer who entered the clinic from 1979 to 1998 and for whom 
detailed clinical follow-up data and tumor RNA for CCAT2 analysis 
were available (Sieuwerts et al. 2007; Smid et al. 2011). The protocol 
for studying biological markers associated with disease outcome was 
approved by the medical ethics committee of the Erasmus Medical 
Center (MEC 02.953). 

The British CRC samples were derived from a set of CRC pa- 
tients recruited as part of the CORGI study who had known geno- 
types at the 8q24 tagSNP (Tomlinson et al. 2007). 

RNA isolation, cDNA synthesis, and quantification of specific 
RNA species 

RNA was extracted using the TRIzol protocol (Invitrogen) and 
DNase-digested (Ambion); the quality of the RNA was assessed 
using agarose gel electrophoresis. cDNA was synthesized using the 
Superscript III cDNA kit (Invitrogen), and diluted cDNA was used 
for RT-PCR analysis using iQ SYBR Green Supermix (Bio-Rad) 
with the appropriate primers (Supplemental Table S2). The 2~ ACt 
method was used to calculate the relative abundance of RNA 
genes compared with GAPDH expression. qRT-PCR analysis for 
miRNAs was performed with the TaqMan MicroRNA Assay Kit 
(Applied Biosystems). Primers and probes for miR-17, miR-20a, 
miR-146a, let-7a, and RNU6-6P snRNA were purchased from Ap- 
plied Biosystems. The 2~ ACt method was used to calculate the 
relative abundance of miRNA genes compared with RNU6-6P 
expression. 

Cell lines 

COLO320DM, HCT116, RKO, and HEK293 cells were obtained 
from the American Type Culture Collection and validated by the 
Characterized Cell Line Core at The University of Texas MD 
Anderson Cancer Center using STR DNA fingerprinting. 

Stable cell line generation 

Stable HCT116 cells for CCAT2 overexpression experiments were 
produced by pMX retrovirus or pcDNA3.1 expression vector 



transfection. For the pcDNA3.1 single clones, we screened CCAT2 
expression in the HCT116 cells transfected with CCAT2 construct 
and randomly chose two clones with high CCAT2 expression (OC1 
and OC2), and one clone with basal level of CCAT2 expression 
(E2). El was originated from the HCT116 cells transfected with the 
empty pcDNA3.1 vector. 

COLO320 stable cells for CCAT2 knockdown experiments were 
generated using pRS-Retro-GFP/neo vector retrovirus (OligoEngine) 
or the Knockout Single Vector Inducible RNAi System (Clontech). 

Virus production 

The CG4T2-containing genomic region was amplified with a 
primer combination (CCAT2 F2 and R2) from HCT116 genomic 
DNA with Pfu polymerase (Invitrogen). After sequencing of the 
PCR product, we cloned it into the pMX plasmid. Then we pro- 
duced CCAT2-containing retrovirus in 293 GP2 cell lines and used 
virus-containing supernatant to infect HCT116 cells. After in- 
fection, HCT116 cells were grown in complete media containing 
puromycin (1 |xg/mL). The shRNAs against CCAT2 (Supplemental 
Table S2) were inserted into the pRS-Retro-GFP/neo vector ac- 
cording to the manufacturer's protocol (OligoEngine). 

Plasmid production 

The sequence used for retrovirus production was also cloned into 
a pcDNA3.1+ vector. The pcDNA CCAT2 plasmid was linearized 
with Bglll and transfected into HCT116 cells using Lipofectamine. 
Selection with G418 (0.5 mg/mL) was carried out until single col- 
onies were generated. The tet-on inducible shCG4T2 clones (Sup- 
plemental Table S2) were generated using the Knockout Single 
Vector Inducible RNAi System according to the manufacturer's 
instructions. 

CCAT2 in situ hybridization 

The frozen tissue slides of two MSS CRCs, two MSI-H CRCs, and 
two normal mucosae were incubated with a double-DIG-labeled 
CCAT2 probe or control RNU6-6P snRNA probe (Exiqon) and 
detected with a polyclonal anti-DIG antibody and alkaline 
phosphatase-conjugated secondary antibody (Ventana) using 
NBT-BCIP as the substrate. The signal intensities of CCAT2 and 
RNU6-6P expression were quantified by using the intensity mea- 
surement tools of the Image-Pro Plus software package (Media 
Cybernetics). 

In vivo tumorigenic assay 

Swiss nu-nu/Ncr nude mice (spontaneous mutant T-cell-deficient 
mice) were obtained from the Animal Production Area of the De- 
partment of Experimental Radiation Oncology at MD Anderson 
Cancer Center. The mice were maintained under specific pathogen- 
free conditions and used in accordance with institutional guide- 
lines. HCT116 cells (1 X 10 6 ) were injected subcutaneously into 
the right flanks of the mice, and tumor size was measured weekly 
for 4 wk. 

In vivo metastasis assay 

A midline incision was made on the left side of the flank, and the 
spleen was exteriorized. HCT116 cells (1 X 10 6 cells) were injected 
into the spleen, after which the wound was closed with surgical 
metal clips. The mice were sacrificed after 8 wk, and their spleens 
and livers were removed and examined for tumor metastases on 
the surface of the liver. The liver specimens were formalin fixed 
and paraffin embedded for histological analysis. 
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In vitro migration assays 

We used two assays: In the first, HCT116 cells (200,000 cells in 
200 (jlL of serum-free media) were seeded into a Transwell insert in 
a 24-well format (8 |xm; BD Biosciences). To the bottom part of 
the well, 800 jjlL of medium with serum was added. Twenty-four 
hours after seeding, cells were fixed with paraformaldehyde and 
stained with crystal violet/methanol solution. Cells at the top of 
the Transwell were removed using a cotton swab, and photos of 
the Transwells were taken using Geldoc (Bio-Rad). The intensity 
of the crystal violet staining in each Transwell was evaluated us- 
ing Photoshop software (Adobe Systems) and compared among 
the different treatments. 

The second type of assay, a cell migration assay, was per- 
formed according to the protocol previously described (Spannuth 
et al. 2009). Cells were suspended in serum-free media (65,000 
cells/insert) and seeded onto the gelatin-coated inserts. The cells 
that migrated to the bottom of the wells were fixed, stained, and 
counted using a microscope. For each well, 10 different fields were 
counted, and the average number of cells was determined. Both 
types of assays were performed in triplicate experiments. 

SiRNA treatment 

Cells were seeded onto 24-well or six-well plates. After reaching 
50% confluence, cells were transfected with 50 nM siRNA by 
Lipofectamine 2000 (Life Technologies) for 48 h before collecting 
RNA or protein for analysis. 

Genomic instability analysis 

Cells were exposed to Colcemid (0.04 |xg/mL) for 25 min at 37°C 
and to hypotonic treatment (0.075 M KC1) for 20 min at room 
temperature, and then were fixed in a methanol and acetic acid 
mixture (3:1 by volume) for 15 min and washed three times in the 
fixative. The slides were air-dried, stained in 4% Giemsa, and coded 
for the blind analysis. Later, the slides were decoded for the eval- 
uation of results. Slides were analyzed for several parameters, in- 
cluding chromosome aberrations (as evidenced by both chromo- 
some- and chromatid-type breaks), fragments, tetraploidy, fusions, 
and formation of tri-radials. 

Spectral karyotyping 

Spectral karyotyping was performed according to the manufac- 
turer's protocol using Human Paint probes (Applied Spectral Im- 
aging, ASI). Images were captured using a Nikon 80i microscope 
equipped with Spectral karyotyping software from ASI. 

CREST staining 

After fixation, cells were incubated for 1 h at room tempera- 
ture with human serum from scleroderma patients (kindly pro- 
vided by Maureen Mayes from the University of Texas-Houston 
Medical School) that produced, during the course of their dis- 
ease, anti-centromere antibodies used for staining. Staining was 
visualized using FITC-conjugated donkey anti-human antibody. 
Beta- tubulin was detected using Cy3 -conjugated antibody (Sigma- 
Aldrich). 

Subcellular fractionation 

The separation of nuclear and cytosolic fractions was performed 
using the PARIS Kit (Life Technologies) according to the manu- 
facturer's instructions. 



RNA immunoprecipitation 

We used the Magna RIP Kit (Millipore) according to the manu- 
facturer's instructions. Cells were prepared in RIP lysis buffer, and 
the RNA-protein complexes were immunoprecipitated using 
anti-TCF7L2 or normal goat IgG (control). RNA was purified 
using phenol:chloroform:isoamyl alcohol and subjected to re- 
verse transcription-PCR or real-time PCR analysis (Supplemental 
Table S2). Control amplification was carried out on input RNA 
before immunoprecipitation. 

TOP-FIash WNT reporter 

Cells were transfected with 250 ng of the TOP-FLASH or FOP- 
FLASH reporter constructs together with 25 ng of the Renilla 
luciferase vector. Luciferase activity was measured by the Dual- 
Luciferase Reporter Assay System (Promega) 48 h after transfection, 
and TOP or FOP values were normalized to Renilla values. The TOP/ 
FOP ratios were calculated and used as indicators of the endoge- 
nous level of WNT signaling. 

Immunofluorescent assay 

Cells were seeded on a 96-well plate. After 48 h, the cells were 
permeabilized and fixed in 4% paraformaldehyde in PBS buffer at 
room temperature for 15 min. The cells were then incubated with 
anti-vimentin (V9, Novus) overnight and finally incubated with 
secondary antibody and DAPI for 1 h. All matched samples were 
photographed using an immunofluorescence microscope and 
identical exposure times. Each experiment was performed in trip- 
licate and repeated three times. 

Allele quantification by Pyrosequencing 

We used the Pyrosequencing technology (QIAGEN) to measure the 
frequency of the G and T alleles of the CCAT2 transcript. The 
principles of the method and analysis have been previously 
reported (Wang and Elbein 2007). In our study, nested PCR re- 
actions were done for each sample. The first PCR was performed 
using the primers 1-F (TTTAGCAGCTGCATCGCTCCATAG) and 
2-R (CTCCCTCCCCCACATAAAAT). The second PCR combined 
a universal biotinylated primer for subsequent single-strand 
DNA capture for the Pyrosequencing reaction. One microliter 
of the first PCR was used for the nested amplification, and a 
reaction was performed to capture the forward strand using 
the primers SNP/F1 (GGG ACGAATAAACTCTCCTCCTAC) , SNP/ 
RU1 (GGGACACCGCTGATCGTTTACTCCCTCCCCCACATAAAAT), 
and the universal biotinylated primers UBP (Bio-GGGACACCG 
CTGATCGTTTA). An independent PCR reaction was performed to 
capture the reverse strand using the primers SNP/FU2 (GGGA 
CACCGCTGATCGTTTAGAGGTGTAGCCAGAGTTAATACCC), SNP/ 
R2 (CTCCCTCCCCCACATAAAAT), and UBP (Bio-GGGACACC 
GCTGATCGTTTA). After cleanup in a vacuum prep station, the 
single-strand DNA was combined with sequencing primers to read 
the forward (SNP/S1, AGCTCAGCAGATGAAA, and the sequence 
to analyze GG/TCACTG) and reverse (SNP/S2, TCTTTGTACTT 
TTCTCAGTG, and the sequence to analyze A/CCTTTCATC) strands 
of the target SNP. Each measurement was performed in duplicate, 
and the values from the forward and reverse strands was combined 
to give the frequency of the G and T alleles. 

Statistical analysis 

For the colorectal cancer samples, the statistical analyses were 
performed in R (version 2.14.2). The Shapiro-Wilk test was ap- 
plied to determine whether data followed a normal distribution. 
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Accordingly, the t-test or the nonparametric Mann-Whitney- 
Wilcoxon test was applied to assess the relationship between 
CCAT2 expression levels and clinical parameters. Fisher's exact test 
was used to assess the association between CCAT2 and the number 
of metastases in mice. For the breast cancer cohorts, data were 
analyzed using the STATA statistical package, release 9 (STATA 
Corp.) and SPSS 15.0 (IBM Corporation). We investigated the 
prognostic value of the clinical and biological variables with me- 
tastasis-free survival (MFS) as the end point using Cox univariate 
and multivariate regression analyses. CCAT2 RNA levels were an- 
alyzed as a log-transformed continuous variable and also, for vi- 
sualization in Kaplan-Meier survival analysis, as a dichotomized 
variable based on the median level of CCAT2 RNA, with the log- 
rank test used to evaluate differences. A Spearman's rank-order 
correlation test was applied to measure the strength of association 
between the expressions of CCAT2 and MYC in colorectal cancer 
samples and between CCAT2 RNA and CIN score in breast cancer 
samples. All tests were two sided, and P < 0.05 was considered 
statistically significant. 

Data access 

The CCAT2 RNA sequence reported in this paper has been de- 
posited in GenBank (http://www.ncbi.nlm.nih.gov/nuccore) 
under accession number GQ9 11591. 
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